Versioning Data Is About More than Revisions: A Conceptual Framework and Proposed Principles

نویسندگان

چکیده

A dataset, small or big, is often changed to correct errors, apply new algorithms, add data (e.g., as part of a time series), etc. In addition, datasets might be bundled into collections, distributed in different encodings mirrored onto platforms. All these differences between versions need understood by researchers who want cite the exact version dataset that was used underpin their research. Failing do so reduces reproducibility research results. Ambiguous identification also impacts and centres are unable gain recognition credit for contributions collection, creation, curation publication individual datasets. Although means identify using persistent identifiers have been place more than decade, systematic versioning practices currently not available. this work, we analysed 39 use cases current across 33 organisations. We noticed term ‘version’ very general sense, extending beyond common understanding refer primarily revisions replacements. Using concepts developed software Functional Requirements Bibliographic Records (FRBR) conceptual framework, six foundational principles datasets: Revision, Release, Granularity, Manifestation, Provenance Citation. These provide high-level framework guiding consistent practice can serve guidance providers when setting up own revision protocols procedures.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nurse Job Satisfaction: Is a Revised Conceptual Framework Needed?

Background and Objectives: Job satisfaction is a critical factor in attracting and retaining nurses. Although many studies have dealt with nurses’ job satisfaction, rapid transformation of the community and health systems can alter the factors influencing this issue, hence calling for continuous monitoring of job satisfaction as perceived by nurses. Built on this necessity, the present study wa...

متن کامل

Abortion is more than a debate about conscientious objection.

Pettinger M, et al. Women’s Health Initiative Investigators. Effects of estrogen plus progestin on gynecologic cancers and associated diagnostic procedures: the Women’s Health Initiative randomized trial. JAMA 2003; 290: 1739–1748. 28 Archer DF. The effect of the duration of progestin use on the occurrence of endometrial cancer in postmenopausal women. Menopause 2001; 8: 245–251. 29 de Vries CS...

متن کامل

Keratin 13 is a more specific marker of conjunctival epithelium than keratin 19

Introduction To evaluate the expression patterns of cytokeratin (K) 12, 13, and 19 in normal epithelium of the human ocular surface to determine whether K13 could be used as a marker for conjunctival epithelium. Methods: Total RNA was isolated from the human conjunctiva and central cornea. Those transcripts that had threefolds or higher expression levels in the conjunctiva than the cornea wer...

متن کامل

PI3K and mTOR inhibitor, NVP-BEZ235, is more toxic than X-rays in prostate cancer cells

Background: Radiotherapy and adjuvant androgen deprivation therapy have historically been the first treatment choices for prostate cancer but treatment resistance often limits the capacity to effectively manage the disease. Therefore, alternative therapeutic approaches are needed. Here, the efficacies of radiotherapy and targeting the pro-survival cell signaling components epidermal growth fact...

متن کامل

p63 is more sensitive and specific than 34βE12 to differentiate adenocarcinoma of prostate from cancer mimickers

Objective(s): Prostate cancer is the world’s leading cause of cancer and the second cause of cancer-related death in men after lung cancer. Differentiation of prostate adenocarcinoma from benign prostate lesions and hyperplasia sometimes cannot be done on the basis of morphologic findings. Considering the fact that in the prostate adenocarcinoma there is no basal cell layer, basal cell markers ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Science Journal

سال: 2021

ISSN: ['1683-1470']

DOI: https://doi.org/10.5334/dsj-2021-012